Examining Talker and Phoneme Generalization of Dimension-Based Statistical Learning in Speech Perception
نویسندگان
چکیده
Speech perception flexibly adapts to short-term regularities of the ambient speech input. Recent research demonstrates that the function of an acoustic dimension for speech categorization at a given time is relative to its relationship to the evolving distribution of dimensional regularity across time, and not simply to its fixed value along the dimension. Two studies examine the nature of this dimension-based statistical learning in online word recognition, testing generalization of learning across talkers and across phonetic categories. The results indicate that dimension-based statistical learning is specific to the experienced regularities, resisting transfer across talkers or phonetic categories.
منابع مشابه
Visual phonemic ambiguity and speechreading.
PURPOSE To study the role of visual perception of phonemes in visual perception of sentences and words among normal-hearing individuals. METHOD Twenty-four normal-hearing adults identified consonants, words, and sentences, spoken by either a human or a synthetic talker. The synthetic talker was programmed with identical parameters within phoneme groups, hypothetically resulting in simplified ...
متن کاملAcoustic differences, listener expectations, and the perceptual accommodation of talker variability.
Two talkers' productions of the same phoneme may be quite different acoustically, whereas their productions of different speech sounds may be virtually identical. Despite this lack of invariance in the relationship between the speech signal and linguistic categories, listeners experience phonetic constancy across a wide range of talkers, speaking styles, linguistic contexts, and acoustic enviro...
متن کاملAuditory}visual integration of talker gender in vowel perception
The experiments reported here used auditory}visual mismatches to compare three approaches to speaker normalization in speech perception: radical invariance, vocal tract normalization, and talker normalization. In contrast to the "rst two, the talker normalization theory assumes that listeners' subjective, abstract impressions of talkers play a role in speech perception. Experiment 1 found that ...
متن کاملConsonant confusion structure based on machine classification of visual features in continuous speech
This study is a first step in selecting an appropriate subword unit representation to synthesize highly intelligible 3D talking faces. Consonant confusions were obtained with optic features from a 320-sentence database, spoken by a male talker, using Gaussian mixture models and maximum a posteriori classification methods. The results were compared to consonant confusions obtained from visual-on...
متن کاملLexically guided phonetic retuning of foreign-accented speech and its generalization.
Listeners use lexical knowledge to retune phoneme categories. When hearing an ambiguous sound between /s/ and /f/ in lexically unambiguous contexts such as gira[s/f], listeners learn to interpret the sound as /f/ because gira[f] is a real word and gira[s] is not. Later, they apply this learning even in lexically ambiguous contexts (perceiving knife rather than nice). Although such retuning coul...
متن کامل